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Abstract 

We study the problem of obtaining efficient, deterministic, black-box polynomial identity 
testing algorithms (PIT) for read-once oblivious algebraic branching programs (ABPs). This 
class has an efficient, deterministic, white-box polynomial identity testing algorithm (due to Raz 
and Shpilka [RS05]), but prior to this work had no known such black-box algorithm. Here we 
obtain the first quasi-polynomial sized hitting sets for this class, when the order of the variables 
is known. This work can be seen as an algebraic analogue of the results of Nisan |Nis92| and 
Impagliazzo-Nisan-Wigderson |INW94| for space-bounded pseudorandom generators. 

We also show that several other circuit classes can be black-box reduced to read-once oblivi- 
ous ABPs, including set-multilinear ABPs (a generalization of depth 3 set-multilinear formulas), 
non-commutative ABPs (generalizing non-commutative formulas), and (semi-)diagonal depth-4 
circuits (as introduced by Saxena |Sax08] . and recently shown by Mulmuley [Mull 2 to have im- 
plications for derandomizing Noether's Normalization Lemma). For set-multilinear ABPs and 
non-commutative ABPs, we give quasi-polynomial-time black-box PIT algorithms, where the 
latter case involves evaluations over the algebra of (D + 1) x (D + 1) matrices, where D is the 
depth of the ABP. For (semi-)diagonal depth-4 circuits, we obtain a black-box PIT algorithm 
(over any characteristic) whose run-time is quasi-polynomial in the runtime of Saxena's white- 
box algorithm, matching the concurrent work of Agrawal, Saha, and Saxena |ASS12] , Finally, 
by combining our results with the reconstruction algorithm of Klivans and Shpilka [KS06 , we 
obtain deterministic reconstruction algorithms for the above circuit classes. 



1 Introduction 

Let C be an arithmetic circuit in the input variables x±, . . . ,x n , over a field F. The output 
C(x\, . . . ,x n ) is a polynomial / in the ring ¥[xi, . . . ,x n ]. The polynomial identity testing (PIT) 

problem is to efficiently determine "/ = 0" . In particular, we are asking if the formal expression 
/, as a polynomial in ¥[x±, . . . ,x n ], is zero. Schwartz and Zippel |Zip79, Sch80] showed that if 



/ / G ¥[x\, . . . , x n ] is a polynomial of degree < d, and a%, . . . , a n E S C F are chosen uniformly 
at random, then f{ot\, . . . ,a n ) = with probability at mos {]< d/\S\. Thus, given the circuit C, 
we can perform these evaluations efficiently, giving an efficient randomized procedure for answering 
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"/ = 0". An important open problem is to find a derandomization of this algorithm, that is, to 
find a deterministic procedure for PIT that runs in polynomial time (in the size of the circuit C). 

One interesting property of the above randomized algorithm of Schwartz-Zippel is that the 
algorithm does not need to "see" the circuit C. Namely, the algorithm only uses the circuit to 
compute the evaluations f(a\, . . . ,a n ). Such an algorithm is called a black-box algorithm. In 
contrast, an algorithm that can access the internal structure of the circuit C is called a white-box 
algorithm. Clearly, the designer of the algorithm has more resources in the white-box model and 
so one can expect that solving PIT in this model should be a simpler task than in the black-box 
model. 

The problem of derandomizing PIT has received a lot of attention in the past few years. In 
particular, many works examine a particular class of circuits C, and design PIT algorithms only for 
circuits in that class. One reason for this attention is the strong connection between deterministic 
PIT algorithms for a class C and lower bounds for C. This connection was first observed by Heintz 
and Schnorr |HS80| (and also by Agrawal | Agr05]) for the black-box model and by Kabanets and 
Impagliazzo [KI04] for the white-box model (see also Dvir, Shpilka and Yehudayoff [DSY09]). 
Another motivation for studying the problem is its relation to algorithmic questions. Indeed, the 
famous deterministic primality testing algorithm of Agrawal, Kayal and Saxena [AKS04J is based 
on derandomizing a specific polynomial identity. Finally, the PIT problem is, in some sense, the 
most general problem that we know today for which we have randomized coRP algorithms but no 
polynomial time algorithms, thus studying it is a natural step towards a better understanding of 
the relation between RP and P. For more on the PIT problem we refer to the survey by Shpilka 
and Yehudayoff fSYTU] . 

Although the white-box model seems to be simpler than the black-box model, for most models 
for which a white-box PIT algorithm is known, also a black-box PIT algorithm is known, albeit 
sometimes with worse parameters. Such examples include depth-2 circuits (also known as sparse 
polynomials) |BOT881 iKSOl] . depth-3 E)IE(fc) circuits [SSTT] . Read-fc formulas |AvMVllj and 
depth-3 tensors (also known as depth-3 set-multilinear circuits) [RS05, FS12J. While the running 
time of the algorithms for depth-2 circuits and £II£(&;) circuits are essentially the same in the white- 
box and black-box models, for Read-A; formulas and set-multilinear depth-3 circuits we encounter 
some loss that results in a quasi-polynomial running time in the black-box model compared to a 
polynomial time algorithm in the white-box model. 

Until this work, the only model for which an efficient white-box algorithm was known without 
a (sub-exponential) black-box counterpart was the model of non-commutative arithmetic formulas, 
or, more generally, the models of non-commutative algebraic branching programs (ABPs) and set- 
multilinear algebraic branching programs [RSQ5j (see Subsection 1.1 for definitions). 

The main result in this paper is a quasi-polynomial time PIT algorithm in the black-box model 
for read-once oblivious algebraic branching programs. Using this result we obtain black-box algo- 
rithms of similar running times for the models of set-multilinear ABPs, non-commutative ABPs, as 
well as for diagonal circuits as defined by Saxena [Sax08j. Although exponential lower bounds are 
known for these models, we note that the algebraic hardness- versus-randomness result of Kabanets 
and Impagliazzo |KI04| (as well as the extension of this result by Dvir, Shpilka and Yehuday- 
off [DSY09J to low-depth circuits) does not imply a black-box PIT algorithm for the model, since 
their technique does not work for the restricted models studied here. 

Although the models which are considered here may seem a bit unnatural at first sight, we note 
that the non-commutative model received a lot of attention in the algebraic complexity literature 
(mostly in the last few years) as it gives a good starting point for understanding the hardness 
of computing the determinant and permanent |Ms91l ICWl IAJS091 IXSTUl IHWYlOal ICHSS11L 
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IHWYlOb] IHY12j . In addition, we show in the paper that the model of set-multilinear ABPs 
generalizes the so called model of diagonal circuits that was introduced by Saxena |Sax08| . Recently, 
Mulmuley |Mull2| has shown a connection between derandomizing PIT for diagonal circuits and 
the problem of derandomizing Noether's normalization lemma. We discuss this in more detail 
in Subsection 1.3 We also note that derandomizing PIT in the general ABP model is a very 
important question. Specifically, the work of Ben-Or and Cleve [BOC92J implies that black-box 
derandomization of PIT, even for width-3 ABPs, would imply derandomization of PIT for general 
arithmetic circuits of logarithmic depth and in general, a quasi-polynomial time derandomization of 
PIT for poly-size arithmetic circuits. Saha, Saptharishi and Saxena [SSS09] showed that black-box 
derandomization of PIT of width-2 ABPs implies derandomization of PIT for depth-3 circuits, a 
model for which our knowledge is quite limited at the moment. 

Besides the natural goal to obtain the most general possible PIT result, the problem of obtaining 
a black-box version of the algorithm of Raz and Shpilka [RS05] is interesting because of the technique 
used there. Roughly, the PIT algorithm of |RS05| works for any model of computation that out- 
puts polynomials whose space of partial derivatives is (relatively) low-dimensional. Set-multilinear 
ABPs, non-commutative ABPs, low-rank tensors, and so called pure arithmetic circuits (defined in 
Nisan and Wigderson [NW96J ) are all examples of algebraic models that compute polynomials with 
that property, and so the algorithm of [RS05] works for them. In some sense using information on the 
dimension of partial derivatives of a given polynomial is the most applicable technique when trying 
to prove lower bounds for arithmetic circuits (see e.g., [Nis9 1 \ INW96} IRaz061 IRaz09"l IRS Y08| IRY09 j ) 
and so it was an interesting problem to understand whether this powerful technique could be carried 
over to the black-box setting. Prior to this work it was not known how to use this low-dimensional 
property in order to obtain a small hitting set, and this paper achieves this goal. Earlier, in [FS12J, 
we obtained a black-box algorithm for the model of low-rank tensors, but could not prove that it 
also works for the more general case of set-multilinear ABPs (we still we do not know whether this 
is the case or not - we discuss the similarities and differences between this and our new algorithm 
in Subsection 1.3), so this work closes a gap in our understanding of such low-dimensional models. 



We shall now define the model and state our results. 



1.1 The model 

The model that we consider in this paper is that of set-multilinear algebraic branching programs. 
In fact, we will work with a slightly more general model, but we first describe the model of set- 
multilinear ABPs. 

Algebraic branching programs were first defined in the work of Nisan )Nis91| who proved expo- 
nential lower bounds on the size of non-commutative ABPs computing the determinant or perma- 
nent polynomials. 

Definition 1.1 (Nisan). An algebraic branching program (ABP) is a directed acyclic graph 
with one vertex of in-degree zero, which is called the source, and one vertex of out-degree zero, 
which is called the sink. The vertices of the graph are partitioned into levels numbered Q,...,D. 
Edges may only go from level i — 1 to level i for i = 1, . . . , D. The source is the only vertex at level 
and the sink is the only vertex at level D. Each edge is labeled with a affine function in the input 
variables. The width of an ABP is the maximum number of nodes in any layer, and the size of the 
ABP is the number of vertices. 

Each directed source-sink path in the ABP computes a polynomial, which is a product of the 
labels on the edges in that path. As this work concerns non-commutative computation, we specify 
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that the product of the labels is in the order of the path, from source to sink. The ABP itself 
computes the sum of all such polynomials. 

We consider a slight variation of this model which we call set-multilinear ABP, in line with 
the term coined by Nisan and Wigderson [NW96]. In the set-multilinear scenario the variables are 
partitioned into sets 

I = IlUl 2 U'--Ul D , 

where 

A set-multilinear monomial is a monomial of the form 

— •Cljii ' -^2,12 ' ' ' •^D,t£>' 

In words, a set-multilinear monomial is a multilinear monomial that contains exactly one variable 
from each Aj. A set-multilinear polynomial is a polynomial consisting of set-multilinear monomials. 
In other words, the coefficients of a set-multilinear polynomial can be viewed as map from [n] to 
the field F, and thus is an D-dimensional tensor. 

Definition 1.2 (Set-multilinear ABP). A set-multilinear algebraic branching program 

(ABP) in the variable set X = X\ U X2 U • • • U Xr> is an ABP of depth D, such that each 
edge between layer i — 1 and layer i is labeled with a (homogeneous) linear function in the variable 
set Xi . 

It is clear from the definition that a set-multilinear ABP computes a set-multilinear polynomial. 
It is also not hard to see that any set-multilinear polynomial can be computed by a set-multilinear 
ABP. 

In fact, our result holds for the following model, that we call read-once oblivious ABPs, as well. 

Definition 1.3 (Read-Once Oblivious ABP). An read-once oblivious ABP in the variable set 
X = {xi, . . . ,X£>} is an ABP of depth D, such that each edge between layer i — 1 and layer i is 
labeled with a univariate polynomial in Xi of degree < n. 

Note that unlike previous definitions, in read-once oblivious ABPs we allow edges to be labeled 
with arbitrary univariate polynomials (with a bound of n on the degree) and not just with linear 
forms. Observe that be mapping Xij -H- x\ transforms any set-multilinear ABP into an read-once 
oblivious ABP and vice versa (when we index j starting at zero). 

1.2 Our results 

A black-box PIT algorithm is also known as an explicit hitting set, which we now define. We phrase 
the definition in generality to capture the notion of hitting sets for non-commutative polynomials, 
generalizing the usual notion. 

Definition 1.4 (Hitting Set). Let C be a class of non- commutative n-variate polynomials, with 
coefficients in F. Let 1Z be a non- commutative ring with a (commutative) ring homomorphism 
F — > 1Z, so that polynomials in C are defined over 1Z. A set % C lZ n is a hitting set for C if for 
allfen,f = Oifff\ n = 0. 

The hitting set T~L is t{n)- explicit if given an index into %, the corresponding element ofri can 
be computed in t(n)-time. 
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When C is commutative, we will always use 1Z = F, and when C is non-commutative, we will 
take 7Z = ¥ mxm for some appropriate m. 

Our main result is a quasi-polynomial time black-box PIT algorithm for read-once oblivious 
ABPs. 



Theorem (Theorem 3.22, PIT for read-once oblivious ABPs). Let C be the set of D-variate poly- 
nomials computable by width r, depth D, individual degree < n read-once oblivious ABPs. If 
|F| > po\y(D, n, r), then C has a poly(D, n, r)- explicit hitting set, of size < poly (D,n,r)°^ D ). 

We remark here that our result assumes that the read-once oblivious ABP takes the order x\ < 
■ ■ ■ < x n , and that our results do not hold under permutation of the variables. In Subsection 3.1 



we explain our proof technique and give an overview of the proof. Using this theorem we obtain 
black-box PIT algorithms for several related models. We first observe that PIT for set-multilinear 



ABPs is an immediate corollary of Theorem 3.22 As with read-once oblivious ABPs, we assume 
that we know the partition of the variables into the D sets, and our results do not hold under 
permutation of the variables. 



Theorem ( |Theorem 3^23| PIT for set-multilinear ABPs). Let X = X l UX 2 U- ■ -UX D , where Xi = 
{xn, . . . ,Xi tn }, be a known partition. LetC be the set of set-multilinear polynomials f(X±, . . . ,Xd) : 
¥ nD — > ¥ computable by a width r, depth D, set-multilinear ABP. If |F| > poly(D, n, r), then C has 
a poly (D,n,r)- explicit hitting set, of size < po\y(D , n, r) ^ D \ 



Next, we consider the model of non-commutative ABPs (see Section 4). Raz and Shpilka [RSOS^ 
gave a polynomial time white-box PIT algorithm for this model and we obtain a quasi-polynomial 
time black-box PIT algorithm. The evaluation points in our hitting set are vectors of (D+l) x (D+l) 



matrices for ABPs of depth D. We explain this choice in Section 4 In contrast to the above two 
results, this result for non-commutative ABPs makes no assumption about the variable ordering, 
and thus still holds under permutation of the variables. 



Theorem (Corollary 4.7 Black-box PIT for non-commutative ABPs). Let NC be the set of n- 
variate non- commutative polynomials computable by width r, depth D ABPs. If\¥\ > po\y(D,n,r), 
then NC has a poly (D,n,r)- explicit hitting set over (D + 1) x (D + 1) matrices, of size < 
poly {D,n,r)°^ D \ 



Saxena [Sax08] defined the model of diagonal circuits (see Section 5 for definition) in an attempt 



to capture some of the complexity of depth-4 circuits. Relying on the algorithm of [RS05], Saxena 
obtained a polynomial time white-box PIT algorithm for this model (for a certain setting of the 
parameters). Saha, Saptharishi and Saxena [SSS11], with the essentially same techniques, later 
extended Saxena's white-box results to the so-called semi-diagonal model. Simultaneously and 
independently of our work, Agrawal, Saha and Saxena [ASS12j gave a black-box PIT algorithm 
for semi-diagonal depth-4 circuits that runs in quasi-polynomial time in the runtime of the known 
white-box algorithm, and works over fields of large characteristic. 

Using our main theorem and a new reduction from diagonal circuits to read-once oblivious ABPs 
(Lemma 5.8) we obtain a PIT algorithm for diagonal circuits that runs in time quasi-polynomial in 



the white-box algorithm given by Saxena. Further, we do so over any characteristic in a unified way. 
As with our result on non-commutative ABPs, this result makes no assumptions about variable 
order. With essentially no modification, we obtain similar results for semi-diagonal circuits. 



Theorem (Theorem 5.11, Black-box PIT for semi-diagonal circuits). Let ¥ be a field of arbitrary 



characteristic. Let SVC be the set of n-variate polynomials computable by semi-diagonal depth- 
4 circuits, that is, of the form = J2ie[k] m i(x) ' PiT ' ' " > where mi(x) is a monomial of 
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degree < d, n^Li(l + e «,j) — e an ^ * s a sum °/ univariate polynomials of degree < d T/ien j/ 
|F| > poly(n, k, d, e) t/ien 52?C has a poly(n, k, e, d) -explicit hitting set of size < poly(n, d, k, e)°^ n \ 

Klivans and Shpilka }KS06| gave a polynomial-time learning algorithm for read-once oblivious 
ABPg^J given membership and equivalence queries. That is, they give a deterministic algorithm 
that can learn an unknown / computed be a small read-once oblivious ABP, given a oracle that 
evaluates / ("membership query"), as well an oracle such that for any hypothesis h computed by 
a small read-once oblivious ABP with h 7^ /, the oracle returns an evaluation point x such that 
h{x) 7= f{x) ("equivalence query"). Membership queries can be implemented deterministically 
given black-box access to /, and equivalence queries can be implemented using random queries. 
That is, by appealing to the Schwartz-Zippel algorithm, distinguishing h 7^ / can be done using 
random evaluations. 

Our above results construct hitting sets for read-once oblivious ABPs, and as these are closed 
under subtraction (that is, for equivalence queries, h — f is also a small read-once oblivious ABP), 
our hitting set will always contain a distinguishing evaluation for h and /, if h 7^ /. Thus, we 
can derandomize the equivalence queries needed for |KS06j . Further, our results on set-multilinear 
ABPs, non-commutative ABPs and semi-diagonal depth-4 circuits, all rely on reductions to read- 
once oblivious ABPs, and these reductions also hold in the learning setting. We omit further details, 
and state the following result. 

Theorem 1.5 (Extension and Derandomization of [KS06]). There is a deterministic polynomial- 
time learning algorithm from membership and equivalence queries that can learn each of the follow- 
ing models: read-once oblivious ABPs, set-multilinear ABPs, non- commutative ABPs and semi- 
diagonal depth-4 circuits. In the first three models the algorithm outputs an hypothesis from the 
same class as the unknown function, but in the case of semi-diagonal depth-4: circuits the outputted 
hypothesis is a read-once oblivious ABP. Given such query access, the running time of the algorithm 
is polynomial in the siz^\ of the underlying branching program or circuit. 

Furthermore, such membership and equivalence queries can be implemented in randomized- 
polynomial time, or in deterministic quasi-polynomial time, for any of the classes mentioned above. 



1.3 Related work 

In |Mull2| Mulmuley studies the problem of derandomizing Noether's normalization lemma. 
Roughly, this lemma says that, for an algebraically closed field K and a projective variety 
V C P(K S ) (P(K S ) is the projective space over K s ) of dimension n, if we pick a linear trans- 
formation ^ : K s — > K m at random for m > n, then ^ will be well-defined on V. Furthermore, 
for any v G V the set (ty(v)) n V, for such a random ty, is finite. The complexity theoretic 
question associated with the lemma, is given the set of polynomials defining V, how to explicitly 
construct the map ^. In fact, even given ^, it is not clear how to verify that it satisfies the required 
properties. In his paper Mulmuley considers two explicit varieties. The one that is related to di- 
agonal circuits is the coordinate ring K[V] of a polynomial representation V of G = SL m (K). An 
equivalent formulation of Noether's normalization lemma says that there exists a set S C K[y] G ' 
(where K[V r ] G is the subring of G-invariants in K[V]) consisting of poly(n) homogeneous invari- 
ants such that K[1/] G is integral over the subring generated by S. Again, a random choice of S 
has the required property, but deterministic construction of such a small S is not known (nor an 

2 The terminology used in [KS06] is that of learning by multiplicity automata, but these are completely equivalent 
to read-once oblivious ABPs, as shown in [KS06 . 

3 For convenience here, we define the "size" of a semi-diagonal depth-4 circuit to be some poly(n, k, e, d), using the 



notation of Theorem 5.11 even though the actual circuit could be much smaller. 
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efficient way of certifying whether a given S is good, see |Mull2| for a more detailed discussion). 
Theorem 1 . 1 of |Mull2j shows that black-box derandomization of PIT for diagonal circuits implies 
derandomization of Noether's lemma in the sense that one can construct such a set S efficiently. 
Moreover, Mulmuley views this connection as an evidence that derandomization of black-box PIT 
of even diagonal circuits is a hard problem. To support this view |Mull2j mentions that, currently, 
the best deterministic algorithms are doubly exponential in nj^] 



In contrast to this view, our Theorem 5.9 implies that for the variety studied in |Mull2j the set 
S can be constructed in quasi-polynomial time. Thus, instead of looking at the results in |Mull2j 
as indicating hardness of PIT, we suggest to view them as offering algorithms for several problems 
of giving explicit constructions. 

Theorem 1.2 of |Mull2] gives derandomization of Noether's normalization lemma under certain 
assumptions such as EXP containing a function with a sub-exponential lower bound, or assuming 
that EXP is not contained in MA and that the generalized Riemann hypothesis holds. In both cases 
the implied running time is at least quasi-polynomial so our result achieves this without further 
assumptions. 

It is interesting to note that to get a quasi-polynomial sized hitting set for depth-3 diagonal 
circuits one does not need our new result and in fact, such a hitting set can be constructed using 
the techniques of Shpilka and Volkovich {SV09J . 

The reader is encouraged to read |Mull2] to learn more on this connection. 

While the work of Raz and Shpilka [RS05J establishes a white-box PIT algorithm for non- 
commutative ABPs, the black-box PIT question for this model is more intricate since one cannot 
immediately apply the usual Schwartz-Zippel algorithm over non-commutative domains. How- 
ever, Bogdanov and Wee [BW05] showed how, leveraging the ideas in the Amitsur-Levitzki theo- 
rem [AL50] . one can reduce non-commutative black-box PIT questions to commutative black-box 
PIT questions. By then appealing to Schwartz-Zippel, they give the first randomized algorithm for 
non-commutative PIT. They also discussed the possibility of derandomizing their result and raise a 
conjecture that if true would lead to a hitting set of size ~ s log s for non-commutative ABPs of size 
s. Our Theorem 4.6 gives a hitting set of size ~ s logs and does not require unproven assumptions. 



In particular, their conjecture asked about the minimal size of an ABP required to computed 
a polynomial identity of n x n matrices. Recall that a polynomial identity of matrices is a non- 
zero polynomial f(x) such that no matter which n x n matrices we substitute for the variables 
Xi, f evaluates to the zero matrix. Our work bypasses this conjecture, in that we more directly 
translate the non-commutative polynomial into a (set-multilinear) commutative polynomial. Our 
construction consists of (D + 1) x (D + 1) strictly-upper-triangular matrices for depth D non- 
commutative ABPs. It is also not hard to see that there is a non-commutative formula of depth 
D + 1 which is a polynomial identity for the space of (D + 1) x (D + 1) strictly- upper-triangular 
matrices. Thus, our result is tight in that it also illustrates that if we go just one dimension up 
above what is obviously necessary then we can already construct a hitting set. 

In |AMS10| . Arvind, Mukhopadhyay and Srinivasan gave a deterministic black-box algorithm 
for identity testing of sparse non-commutative polynomials. The algorithm runs in time polynomial 
in the number of variables, degree and sparsity of the unknown polynomial. This is similar to the 
running time achieved in the commutative setting for sparse polynomials (see e.g., [BOT88, KSOlJ) 
and in particular it is better than our quasi-polynomial time algorithm. On the other hand our 
algorithm is more general and works for any non-commutative polynomial that is computed by a 



4 More accurately, the doubly-exponential bound is for a general variety V. Because of the reduction to PIT there 
is a single exponential time algorithm for the specific varieties that he is considering. 
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small ABP. 

We note that in the aforementioned [AMSlOj . the authors showed how to deterministically 
learn sparse non-commutative polynomials in time polynomial in the number of variables, 
degree and sparsity. In contrast, for such polynomials our deterministic algorithm requires 
quasi-polynomial time. For general non-commutative ABPs [AMSlOj also obtained a deter- 
ministic polynomial time learning algorithm, but here they need to have the ability to query 
the ABP also at internal nodes and not just at the output node. Our deterministic algorithm 
runs in quasi-polynomial time but it does not need to query the ABP at intermediate computations. 

In a previous paper |FS12| we gave a hitting set of quasi-polynomial size for the class of depth-3 
set-multilinear polynomials. That result is mostly subsumed by the generality of Theorem 3.23[ 



but the previous work has a better dependence on some of the parameters. More importantly, it is 
interesting to note that the proof in [FS12j is also based on the same intuitive idea of preserving 
dimension of linear spaces while reducing the number of variables, but in order to prove the results 
in this paper we had to take a different approach. At a high level the difference can be described 
as follows. We now summarize the algorithm of |FS12| . First the case of degree 2 is solved. In 
this case the tensor is simply a bilinear map. For larger D, the algorithms works by first reducing 
to the bivariate case using the Kronecker substitution xi x n% for i < D/2 and V nl f° r 

1 < i < D/2. Appealing now to the bivariate case, we can take y to be some multiple of x, and 
then undo the Kronecker substitution (and applying some degree reduction) to recover a tensor 
on D/2 variables. If we try to implement this approach with ABPs then we immediately run into 
problems, as the previous work requires that the layers of the ABP are commutative, in that we 
can re-order the layers. While this is true for depth-3 set-multilinear computation, it is not true 
for general ABPs. To generalize the ideas to general ABPs, this work respects the ordering of the 
layers in the ABP. In particular, while the previous work had a top-down recursion, this work 
follows a bottom- up recursion. We merge variables in adjacent levels of the ABP and then reduce 
the degree of the resulting polynomial using an (algebraic rank) extractor. We do this iteratively 
until we are left with a univariate polynomial. Perhaps surprisingly, this gives a set that is simpler 
to describe as compared to the hitting set in [FS12J. On the other hand, if we restrict ourselves to 
set-multilinear depth-3 circuits then the hitting set of |FS12| is polynomially smaller than the set 
that we construct here. 



Independently and simultaneously with this work, Agrawal, Saha and Saxena jASS12j ob- 
tained results on black-box derandomization of PIT for small-depth set-multilinear formulas, 
when the partition of the variables into sets is unknown. They obtained a hitting set of size 
exp((2/i 2 log(s)) /l+1 ), for size s formulas of multiplicative-depth h. We note that their model is 
both stronger and weaker compared to the models that we consider here. On the one hand, this 
model does not assume knowledge of the partition of the variables, whereas our model assumes 
this knowledge. On the other hand, they only handle small-depth formulas, and indeed, the size 
of their hitting grows doubly-exponentially in the depth, whereas our results handle arbitrary 
set-multilinear formulas, according to their definition, if we know the partition, and the size of 
the hitting set grows quasi-polynomially with the depth^} Using their results for set-multilinear 
formulas, Agrawal, Saha and Saxena |ASS12j also give a quasipolynomial-sized hitting set for 
semi-diagonal circuits over large characteristic fields. Our results, in particular our extension of 
Saxena's [Sax08j duality to all fields, can help extend the results of [ASS12J to small characteristics 

5 Their definition of set-multilinear formulas is actually that of a pure formula (see [NW96] ). It is not hard to 
see that pure formulas form a sub-model of the set-multilinear ABPs we examine in this paper. That is, the usual 
transformation from formulas to ABPs can be done preserving set-multilinearity. 



8 



as well. 



We also mention that in (JQS09 [JQS10 Jansen, Qiao and Sarma studied black-box PIT in 
various models related to algebraic branching programs. Essentially, all these models can be cast 
as a problem of obtaining black-box PIT for read-once oblivious branching programs where each 
variable appears on a small number of edges. Their result gives a hitting set of size (roughly) 
n O(feiog(fc)iog(n)) wnen j s an U pper bound on the number of edg es that a variable can label and the 
ABP is of polynomial size. In comparison, our Theorem 3.22 gives a hitting set of size n°^° g ( n ^ 
and works for k = poly(n) (as long as the size of the ABP is polynomial in n). Our techniques 
are very different from those of JQS09| [JQS10| , which basically follow the proof technique of [SV09J. 



This work fits into the research program of derandomizing PIT, in particular derandomiz- 
ing black-box PIT. However, for many of the models of algebraic circuits studied, there are cor- 
responding boolean circuit models for which derandomization questions can also be asked. In 
particular, for a class C of boolean circuits, we can seek to construct a pseudorandom generator 
Q : {0, 1} S — > {0, l} n for C, such that for any circuit C G C on n inputs, we have the e-closeness of dis- 
tributions C(Q(U S )) ss e C(U n ), where [/& denotes the uniform distribution on {0, l} k . Nisan |Nis92 
studied pseudorandom generators for space-bounded computation, and for space S computation 
gave a generator with seed length s = C(lg 2 S). Impagliazzo, Nisan and Wigderson [INW94] later 
gave a different construction with the same seed length. Randomized space-bounded computation 
can be modeled with read-once oblivious (boolean) branching programs, and these generators apply 
to this model of computation as well. Our work, at its core, studies read-once oblivious (algebraic) 
branching programs, and we achieve a quasi-polynomial-sized hitting set, and can thus be viewed 
as having a "seed length" of C(lg 2 S) for ABPs of size S. Aside from the similarities in the state- 
ments of these results, there are some similarities in the high-level techniques as well. Indeed, an 
interpretation by Raz and Reingold [RR99^ argues that the [INW94] generator for space-bounded 
computation can be seen as recursively partitioning the branching program, using an (boolean) ex- 
tractor to recycle random bits between the partitions. Similarly, in our work, we use an (algebraic 
rank) extractor between the partitions. At present, it is unclear if there are formal (or further 
informal) connections between the boolean pseudorandom generators and black-box PIT. 



2 Notation 

We write [n] to denote {1, . . . and [rt] to denote {0, ... ,n — 1}. We write lp to denote the 
indicator function for the set/event P. We use U to denote a disjoint union. 

Matrices and vectors will most often be indexed starting at zero. In observance of this indexing 
scheme, we will write to denote ra-dimensional vectors with entries in S, and will write .SHM" 1 ! 



to denote matrices of size n x m with entries in S. Indexing from 1 (as done in Section 5 ) will be 
indicated by the use of [n] as opposed to [n] . We denote I for the identity matrix, whose dimension 
will always be clear from the context. For a matrix M, we will write Afj , to denote the i-ih row of 
M, and M m j to denote the j-th column, where i and j are indexed from zero. Given a bit vector 
b G {0, ljW, we write bi for the i-th bit of b, where i is indexed from zero.. When n = 0, we use A 
to denote the empty string in {0, and thus for n > 0, |{0, 1}H | = 2 n . 

Given a vector of polynomials / G ¥[x] n and an exponent vector a G N n , we write f a for 

We write M G F[5]W X M to indicate that M is an r x r matrix, with entries in ¥[x\. The (total) 
degree of M, written deg(M), is defined as the maximum degree of its entries. Given a collection of 
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matrices Ai C FW X W ) span.M denotes the span of these matrices, in the vector space fM x IH = 
F^l Given M C fW x W and M C FH X H, we define A4 • N := {MTV : M G .M, iV G AA}. 
Given a polynomial / G F[x], we write Cjs(/) to denote the coefficient of f° in /. For a matrix 
M 6 F[x]M x W, we write £^a(M) to denote the r x r F-matrix, with the <tgs operator applied to 
each entry. When we write "/ G F[x][y]", we will treat / as a polynomial in the variables y, whose 
coefficients are polynomials in the variables x, and correspondingly will write to extract the 

polynomial in x that is the coefficient of the monomial y 13 in /. Given a polynomial / (in ¥[x] or 
in the non-commutative F{x}), we write Hk(f) for the homogeneous part of / of degree k. 

If Mi G FM X M for i G [-D], then we write Iljep] ^ f° r non-commutative multiplication 
Mq • • • Md-i, with the understanding that the multiplication proceeds left to right, starting at 
the lower index of the product (i = 0) and ending at the higher index (D — 1). When this non- 
commutative product is indexed over the range {0, 1}^, we order the product by the lexicographic 
order on {0, 1}M, where the least-significant bit is the rightmost bit. 

3 Hitting Set for Read-Once Oblivious ABPs 

In this section, we construct a hitting set for read-once oblivious ABPs. To make the questions 
more amenable to study, we give the following normal form for read-once oblivious ABPs. 

Lemma 3.1. Let A be a width < r, depth D, individual degree < n read-once oblivious ABP, 
computing a polynomial f € ¥[xo, xd-i]- Thenfori£ there are matrices Mi G ¥[xi]^ x ^ 
of degree < n such that 

fix) = ( n m 4 (x 4 ) 

Conversely, any such function can be computed by a width r, depth D, read-once oblivious ABP. 

Proof. ABP ^=^> matrices: Identify the nodes in the i-th layer of A with a subset of [r] x {i}, so 
that the nodes of A are identified with a subset of [r] x \D + lj and the source is x and the 
sink is x D. Then, define the matrices Mj such that the entry (Mj)^/ takes the label of the edge 
(I x (i — 1), (! x i). Note that such a label is a univariate polynomial in Xi of degree < n. It is 
straight-forward to see that / is computed by this matrix product. 

matrices =^ ABP: This is similar to the above. □ 

Thus, instead of talking about such ABPs, we will be discussing products of matrices Y\ i Mi(xi). 
For induction purposes we will not be restricting ourselves to the (0, 0)-th entry, but will consider 
the full matrices. Such products naturally correspond to multi-source multi-sink ABPs. With this 
normal form in mind, we can give the proof overview. 

3.1 Proof overview 

At a high level, our hitting set can be viewed as a repeated application (in parallel) of a "derandom- 
ized Kronecker product". That is, the usual Kronecker product allows us to merge two variables 
(that is, we merge x and y by taking y <— x m for large enough m) of a polynomial without losing any 
information. Once a single variable remains, one can resort to full-interpolation of this univariate 
polynomial. However, this generic transformation rapidly increases the degree of the polynomial 
and thus the final interpolation step requires too many evaluations to yield a good hitting set. 
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For our hitting set, we first observe that in an read-once oblivious ABP each layer has its own 
variable, so merging two variables in adjacent layers results in the merging of the two layers in the 
ABP. We then give a degree-reduction procedure for the results of such merges. That is, given two 
adjacent layers of our ABP, M(x) G F[i]W x W and N(y) G F[y]M x M, we first set y <- x n (where 
degM, N < n) to obtain M(x)N(y) ~ M(x)N(x n ), where the right-hand term is of degree ~ n 2 . 
We then construct f,g £ ¥[z] of degree < r 2 such that M(x)N(x n ) « M(f(z))N(g(z)), where the 
degree of the right-hand term is now ~ nr 2 . Thus we have that M{x)N{y) ~ M(f(z))N(g(z)), 
and we have only increased the degree by ~ r 2 . As r, the dimension of the matrices (and the width 
of the ABP), stays constant throughout this merging process, one can repeat this merging process 
and have the degree of the polynomial scale linearly with the number of mergings. To then fully 
leverage this idea, we show that pairs of variables can be merged in parallel. 

To discuss our degree-reduction process, we first need to state what it means for ll M(x)N(x n ) ~ 
M{f(z))N(g(z)y\ In the Kronecker substitution, this equivalence typically means that there is a 
bijection between monomials. We will not use that notion, and instead will us a problem-specific 
notion, related to linear algebra. Specifically, for each a, f3 G F, the matrix M{a)N{(5) is some linear 
transformation F^ — > F^. Ranging over all a and f3, we get a space of such transformations, 
which we extend by linearity to a vector space of transformations. Crucially, the polynomial 
M{x)N(y) is zero iff this vector space is zero. More generally, we will show that this linear space 
of transformations captures essentially all we need to know about the polynomial M(x)N(y). Our 
degree reduction lemma (using the Kronecker substitution, and then doing the degree reduction) 
finds two curves f,g G ¥[z] of degree < r 2 such that the space of linear transformations induced 
by M(x)N{y) is the same as the space of linear transformations induced by M{f{z))N(g(z)). It 
follows then that M{x)N(y) ~ M(f(z))N(g(z)), and so we have merged two variables without 
increasing the degree too much. 

To find such curves, we use a (seeded) rank extractor. That is, the n 2 matrices defined by 
the coefficients of M(x)N(y) live in a r 2 -dimensional space, so we seek to map these n 2 vectors to 
a smaller space while preserving rank. Gabizon and Raz [GR05] established such a lemma, and 
our prior work [FS12J improved the parameters in their lemma. Crucially, these rank extractors 
have the form such that the maps they define correspond to polynomial evaluations. Thus, these 
extractors establish (for most values of the seed) an explicit small set of evaluation points where 
the span of M(x)N(y) is preserved. By creating curves that through these points, we obtain the 
desired / and g for our degree reduction. 

3.2 Derandomized Kronecker Product for the Span of an ABP 

We will begin the formal statements of this section by giving lemmas on the span of matrices, and 
how preserving the span of linear transformations can be used for PIT. We begin by showing that 
the span of ABPs can be preserved in a recursive fashion. 

Lemma 3.2. Let M,M' C fM x W and Af,Af' Q F^xH, such that spanX = spanA4' and 
spanAA = spanAA'. Then span(A^ - A/") = span(span(A4) • span (A/")) and span(A4 - A/") = span(A4' • 
A/ 7 ). 

Proof. As M. ■ M C span(A / l) • span(A/") we see that span(A / ( • M) C span(span(A4) • span(AA)) 
by linearity. To show the other direction, consider any A G span(span(A4) • span(AA)). Then 
A = 12i c i(^2j a i,j M j)(^2k b i,k N k), for Mj G M and N k G N. Thus, by bilinearity of the matrix 
product, A = Yli,j,k °i a i,jbi,kMjNk, where MjNf. G Ad -AT, and thus A G span(A4 • Af), yielding the 
first claim. 



11 



The second claim follows from the first, by observing that span(.M • J\f) = span(span(.A/f ) • 
span(AO) = span(span(.M') • span(AA')) = span(^C • TV')- □ 

We will use this as follows. By |Lemma 3.1| we can restrict our attention to matrix-valued 
functions of the form f(x) = Hie[-D] and we work in this level of generality for ease of 

induction. As such, / defines a space of matrices A by evaluating the polynomial on all possible 
inputs. Crucially for PIT, we see that span(^l) is zero iff / is zero. By breaking the matrix product 
in / into variable disjoint left- and right-halves (each on ~ D/2 variables), we can see that this 
space of matrices factorizes into A = Ao • A\, corresponding to the evaluations of the separate 
halves of the ABP. Now, because these halves are variable disjoint, we can in parallel find smaller 
spaces of matrices such that span(^4' c ) = span(^4 c ) for c G {0,1}. From the above lemma, we 
now see that span(„4) = span(„4.Q • A[). The operation "Aq • A'i" squares the number of matrices 
under consideration, so to complete the picture we "derandomize" this Kronecker product to yield 
a "small" family of matrices A' such that span(„4) = span(^4'). It follows then that PIT of the 
original function / reduces to testing each matrix in A' for non-zeroness, which will correspond to 
fully interpolating a univariate polynomial. 

To implement the above program, our main challenge is to thus construct the derandomized 
Kronecker product. To do so, our next lemma shows that the span of a layer of an read-once 
oblivious ABP can be viewed either as the span of the evaluations of that layer, or of the (finite) 
number of (matrix- valued) coefficients of that layer. We will use both characterizations of the span 
of a layer. 

Lemma 3.3. Let M G F[x]M><H. Then for any set S CF with \S\ > deg(M), span{M(a)} oe5 = 
s P an{MM)}SW 

Proof C: Note that M(a) = E£q (M) ^A m ) ■ and thus is in span{£^(M)}gf M) , for any a. 

5: Let d = deg(M) + 1. As the span of a set of vectors is monotonic, it suffices to prove 
the claim when \S\ = d. Recall that polynomial interpolation shows that the map FM — > F^ 
defined by mapping degree < d polynomials to their evaluations on S is a bijective linear map. It 
follows that for any < i < d, there is a set of constants {ai,a}aes such that for any polynomial 
/ G ¥[x] of degree < d, £,*(/) = £ ag s a «, af(ct)- Applying this relation entry- wise we obtain 
<t x i(M) = Eae5 a i,aM(a), and thus <£ x i(M) G span{M(a)} ag s. Varying this over all i yields the 
result. □ 

We now cite a dimension reduction lemma that shows how we can map a vector space of rank 
< r, in a large ambient dimension n, to a vector space of the same rank in an ambient dimension 
r (and thus, is a "rank extractor"). A lemma of this sort was first established by Gabizon and 
Raz |GR05j . That lemma would also work for us, but the version we cite has slightly better 
parameters. 

Lemma 3.4 ( |F5T5p . Let 1 < s < r < n. Let M G FH><M be of rank s, for r' > s. Let K be 
a field extending ¥, and let u> G K be an element of order > n. For a G K define A a G KM x I n l 
by (A a )ij = (uj r ay . Then there are < nr — ( r ^ 1 ) < nr values a£l such that the first s rows of 
A a M have rank < s. 

We now apply this dimension reduction to read-once oblivious ABPs, by observing that even 
though the result M{x)N{x n ) of the Kronecker product induces a linear space of transformations 



most naturally parameterized by the ~ n 2 coefficients of this polynomial (as in Lemma 3.3), each 
transformation is an r x r matrix, and so this linear space of transformations more naturally lives 
in an r 2 -dimensional vector space. Thus, by applying dimension reduction, we make a first step in 
getting a derandomized Kronecker product. 
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Lemma 3.5. Let M G F[x]M><H ; N G %]H X M be of degree < n. Let u be an element of order 
> n 2 . Then, for any «6F, 

sp&n{<£ xly3 (M(x)N(y))} iJeM D sp&n{M{Ja)N((Ja) n )} Eelr 2 } 

and except for < n 2 r 2 values of a, 

sp&n{C xiyj (M(x)N(y))} itje[nl = sp&n{M{uj e a)N((Ja) n )} eelr 2 } 



Proof. 3: This follows from the same reasoning used in Lemma 3.3, in that M(uj a)N((uj a) n ) = 
Y^ijefn} < ^-x i yi (M(x)N(y)) • (u a) i+n: > , and thus the evaluations of M(x)N(y) are in the span of the 
coefficients of M(x)N(y). 

Consider the matrix C G F^M** 2 ! defined so that C i+n j t . := £ x i y3 (M(x)N(y)), where we 
flatten r x r matrices into r 2 -dimensional vectors. Observe, that as in the Kronecker substitution, 
the map — > i + nj is a bijection [n] 2 — > [re 2 ], and thus the rows of C are exactly the coefficients 
of M(x)N(y), and thus spanj^i^ (M( x)N(y))} i j e i n ^ l = row-span(C) 



Taking A a £ ¥^- r ^ x l n 1 as defined in Lemma 3.4, the lemma shows that except for < n 2 r 2 values 
of a, we have that rank(^4 Q C) = rank(C). Recall that (A a )i : i +n j = {uj l a) l+n ^ which implies that 

(A Q ) e ,.C= £ x iyi(M(x)N(y)).(Ja) i+n i = M{Ja)N{{Ja) n ), 

and thus spa.n{M(ui e a)N((uj i a)' n )}i £ j J .2j = row-span(74o,C). Since row-span(C) 5 row-span(^4 a C), 



it follows that if rank(C) = rank(A a C), then row-span(C) = row-span(^4 Q C). As Lemma 3. 4| shows 
there are < n 2 r 2 many a for which rank(C) > rank(^4 Q C), and thus there are < n 2 r 2 many a for 
which row-span(C) ^ row-span(A Q C), yielding the claim. □ 

The above result shows how to choose evaluation points for the two matrices M and N such 
that the span is preserved. However, by itself this does not lead naturally to recursion. Rather, 
we would like to take M(x)N(y) and produce a pair of curves f,g E ¥[z] satisfying M(x)N(y) ~ 
M{f(z))N(g(z)), such that we can apply recursion and merge the variable z with other variables. 
To do this, it is enough that / and g pass through all of the evaluation points for M and N 
in the above lemma, as then this captures the desired span of linear transformations. To create 
these curves, we need the following definition of the Lagrange interpolation polynomials, which 
will be featured in our hitting set. Note that these appeared in the prior work of Shpilka and 
Volkovich SY()!)|. who also used them in a somewhat similar fashion. 

Definition 3.6. Fix s, and distinct f3{ G ¥, for i G [s]. Then the Lagrange interpolation 
polynomials (with respect to s and the Pi's) are the unique polynomials pe G ¥[t] of degree 
< s such that Pe(f3i) = li=e for all i,£ G [sj. 

We now use these polynomials to construct curves passing through the evaluation points shown 
in Lemma 3.5, to yield a reduction of a two-layer read-once oblivious ABP to a one-layer read-once 



oblivious ABP. We will phrase the result more generally, so that we can apply induction. 

Lemma 3.7. For i G {Dj, let Mi G F[x]W x W, Ni G F[ l /]W X W be of degree < n, and f, L G ¥[x], 
Qi G ¥[y] be of degree < m. Let u G F be an element of order > (Dnm) 2 . Let j3j G F with j G [r 2 ] 
be distinct, and let pi be the corresponding Lagrange interpolation polynomials with respect to r 2 
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and the j3j 's. Then, except for < {Dnmr) 2 values of a, we have 



span < H Miifiix)) • J] N^iy)) 

Cspani J] Mi I Y, M" e a)Pi(z) ) • II N * [ E Si((^«) Dn ">(*) 



zeF 



Proof. Define R(x) := IliefD] Mi{fi{x)), so that i? G F[x]W x I r ] is of degree < Dnm, and define 
T{y) := UielDJ N i(9i(y)), so that T G %]MxH Q f degree < Dnm. As w has order > {Dnm) 2 , 



Lemma 3.5 implies that except for < {Dnmr) 2 values of a, 

sp&n{£ xiyj {R{x)T{y))} iJe[Dnm} = span{R{u} £ a)T{{uj e a) Dnm )} ee[r 2 } . 



As lo G F has order > (Dnm) 2 , it follows that |F| > {Dnm) 2 , so that applying Lemma 3.3 
(first to the variable x, then to the variable y) we have that 



span{R{x)T{y)} Xty& = spaxi{€ x iyj(R{x)T{y))} ide i Dri 



(3.8) 
twice 

(3.9) 



Now denote 



U{z) := H Mil Y, h{^a) Vl {£) and V{z) := II ^ E 9iii^) Dnm ) Pi {z) 
ielD] / ie[D] \*G[r2] 

which are both elements of F[z]M x M of degree < Dnr 2 . Then by construction of the Lagrange 
interpolation polynomials, we see that Uifii) = R{Ja) and V(J3t) = T{{u e a) Dnm ), and thus by 
linearity 

span{R{uj e a)T{{uj e a) Dnm )} ee[r 2 } C sp&n{U (z)V (z)} z& . (3.10) 

□ 



Putting equations (3.8), (p. 9ft, and (3.10) together yields the claim. 



Remark 3.11. In the above lemma it is tempting to ask for the "C" to be an "=", for in PIT 
applications we wish to avoid causing a zero polynomial to become a non-zero polynomial (or 
in this case, avoid causing a zero-dimensional span becoming higher-dimensional). However, the 
proof does not give this in general, as inserting the pi{z) polynomial outside the functions fi and 
gi allows us to potentially evaluate the matrices Mj and iV, at points possibly outside the range of 
the polynomials fi and gi, seemingly allowing the dimension to increase. 

Nevertheless, this lemma is enough for constructing a hitting set, as the resulting span is still 
inside that of Y\ i Mj(xj) • ]X Ni{y{), which is ultimately the polynomial whose span we are trying 
to replicate. In such applications, we will start off (by induction) with fi and gi such that 

span | H Mi{fi{x)) • H A^(y)) I = span j H Mi{ Xi ) ■ \{ Ni{ Vi ) I , 
Ue[D] ie[D] I lie[D] ie[D] I 



and so as the "C" in Lemma 3.7 nests between the two terms in this equality, we will in fact get 
"=" in the lemma in this case, so will not turn zero polynomials into non-zero polynomials. 
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3.3 The Generator for Read-Once Oblivious ABPs 

In this section we construct the hitting set for read-once oblivious ABPs. The hitting set is more 
naturally presented as a generator, which we now define. See also the survey |SY10j for more on 
generators. 

Definition 3.12. Let C be a class of circuits computing polynomials on n variables. A polynomial 
map Q : ¥" — > is a generator for C if for every polynomial f computed by a circuit in C, 

f = oifffog = o. 

Given a generator Q, one can show that Q{S t ) is a hitting set for C as long as \S\ is polynomially 



large in the relevant parameters. We present this formally in Lemma 3.21 Given this relation to 
hitting sets, we now proceed to the construction of our generator, and then prove the requisite 
properties. 

Construction 3.13. Let n, r > 1, d > 0, and D = 2 d . Let ¥ be a field of size |F| > (Dnr 3 ) 2 and 
let co G F be of order > (Dnr 2 ) 2 . Let ft eF with i G \r 2 } be distinct, and let p e G ¥[t] be the 
Lagrange interpolation polynomials with respect to r 2 and the ft 's. 

We define a polynomial map g d : Fl d+1 l -)■ Fl D l as follows. We index the inputs to Q d by the 
variables a% for i G \d + 1], and index the outputs of Qd by bit-vectors b G {0, 1}^, so that we write 
^db f or ^ e output associated with b. 

Define, 

SdjS" E II ((l-^O-^^^^ + ^-^.^^a,) 2 ^ 2 ))-^,^), (3.14) 

£ ,...A-i6lr 2 ]i6M 

where we abuse notation and define pt^it) = t. In particular, 

£ ,b Pi-A<xo) = a o- (3-15) 

We now establish properties of this construction. We first prove an easy lemma that gives an 
upper bound on the degrees of the variables «j in Q. 



Lemma 3.16 (Bounding the degree). Assume the setup of Construction 3.13. Let b G {0, 1} 



Fori G 14, deg Qi (G. r(a)) < Dnr 



4 



• For i = d, deg Q (Q , ?(&)) < r 2 < Dnr . 



d,b\ 
d,fe( 



Proof. Recall that deg(pe i ) < r 2 (even for pe_ 1 , as r > 1). Also, recall that 2* < D, for i G Id + lj. 
There are two cases, < i < d and i = d. For < i < d, we see that deg ai (G d ^) < deg(p£ i _ 1 ) • 
2 l nr 2 < Dnr 4 . For i = d, we see that deg ad (Q d ^) < deg(p£ dl ) < r 2 < Dnr 4 , as D,n, r > 1. □ 

The next lemma demonstrates the recursive structure of Q. 



Lemma 3.17 (The recursive structure). Assume the setup of Construction 3.13. For b G {0, 1}^ 1 H , 

bfi-i G {0, 1}, and a denoting the variables at for i G \d — 1], 
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Proof. This claim follows from the definitions, with some rearrangement of terms. 



E II (^-h)-p^M*a i ) + b l .p^ 1 {{J*a l ? 1 ™- )) ■ Ped _ 1 (a d ) 
...,4-ie[r 2 ]ieM 

E E II ((l-bi)-PtU^i) + h-PtU(^if nr2 ) 

t*-i„. . \ , J.. „. /Y. Xi_i„, . \2 d " 1 nr 2 



ie[d-l] 



(l-6d-i)-p/ d _ a (w i:d - 1 a (i _i) + 6 d _i-p/ d _ a ((ar' , - 1 a (i _ 1 )' 4 nr )) -p^Cay) 

2 d " 1 nr 2 



" lE^^H 5^g{3, (^ 1 ad-i) i)BrJ / 2 )p^ 1 (a«i) else 
We now show that the generator can be efficiently computed. 



P£ d -i(<Xd) 



□ 



Lemma 3.18 (Efficiency). Assume the setup of Construction 3.13, For b £ {0,1}^, Q d ^ : 
jpid+l] jpI-D] j S computable by a depth d + 1, width r 2 , read-once oblivious ABI^ in the vari- 
ables a, of degree < Dnr 4 . 

Proof. For j 6 [d + 1], and c G {0, 1} define A i>c £ ¥[0^*^, by 



(Al)4-!A(«i) :=P4_i((w*Oi) ) 



for £i-i,£i £ [r 2 ], where £_i and ^ are ranged over as well, and pt_ x is still the identity polynomial. 
Then we have that 

9dj&Q = e n (c 1 - + ^ •^_ 1 ((^«,) 2inr2 )) 'ft-.M 

«o,...,^_ieIr 2 ]ieId] 

E II (C 1 ~ 6i ) ' ^.oCotJ/i-iA + 6 * " ^,1(^)^-1,^)) • Ai.oCa^-iA 

£ ,-,^-ie[r 2 ] 



[J ((1 - h) ■ Aifliati) + h ■ A iA ( ai )) 



^4<i,o(ad) 



0.0 



by the properties of matrix multiplication. By Lemma 3.1 we see that this matrix product can be 
computed by a width r 2 , depth d + 1, read-once oblivious ABP, and the degree bound follows from 
ILemma 3.161 □ 

We now turn to proving that our construction is indeed a generator for read-once oblivious 
ABPs. In particular, we first show the generator preserves span. 



6 That is, we are constructing a generator for read-once oblivious ABPs, and each coordinate is computed by an 
read-once oblivious ABP on many fewer variables. It is unclear at the moment whether this is amenable to recursion, 
since read-once oblivious ABPs are not closed under composition. 
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Lemma 3.19 (Span preserving). Assume the setup of Construction 3.13. Let Mr G F[xr]^ x ^ 



for b G {0, 1}M be of degree < n. Then, denoting a for the vector of variables olq, • • • , ctd, 



span \ J] M b ( x b) 

,6e{o,i}M ,__ L 



span J] M g (^(a))l 
Ue{o,i}M J 



(3.20) 



Proof. We prove the claim by induction on d. 

d = 0: Observe that for c? = 0, {0, 1}M = {A}, where A is the empty string. Further, we have 



that Qq x : F — )• F is simply the identity map, by definition of p£_ t . Thus, Equation 3.20 is stating 
that, 

span{M A (x A )} a , AeF = span{M A (£ ,A(ao))} ao eF = span{M A (a )} aoe F 
and is thus trivially true. 

d > 0: We will split the span into the multiplication of two different spans, and then appeal to 
Lemma 3.7| to merge these two spans. 



Specifically, denote 



and for c G {0, 1} denote 



M:=\ J] M b( x b) 



Mc ■=< II M ic (xr c ) 

Ue{o,i}N-il ) s&lD/n 

Similarly, for c G {0, 1}, and a being the vector of variables oti for i G |[d — 1], denote 
M':=«j JJ Mr(Q dS {a,a d ^a d )^ 

aa d _ 1 a d e¥^ d + 1 ^ 

and 

| II M bc(^d-l,bc^ a d-^ ( 
It follows from definition that M = Mo -Mi, and by induction, we have that span M c = spanA^- 



Thus, Lemma 3.2 implies that span M = span(span(A^' ) ■ span(A4' 1 )) = span(.A/f • M'i). Observe 
that span M C spanTW = span(A^ • M\), so the claim will follow from showing that span(7W • 
Mi) C spanA-f. 

We now show that span(_A/f • M'i) Q spanAf'. We will prove this claim for each fixed value 
of the variables a. Thus, the matrices in each of M' c are parameterized only by the variable atd—i- 
Further, each matrix in M' c is the product of D/2 matrices each of the form M(/(ay_i)), for 
M G F[ay_i]" x ^ of degree < n and / G F[q^_i] of degree < r 2 (where the degree bound is by 
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Lemma 3.16). As the order of to is > (Dnm) 2 , and |F| > (Dnmr) 2 (taking m = r 2 ) we can apply 
Lemma 3.7 to see that there is some^] value dd-l G F such that 



span<j [J M^g^fc&a^xY) ■ JJ M %1 [Q d _ ljtl {a, a±- X ) 



and rewriting this via Lemma 3.17 



span <; ]J M ?c ^g- c (a,d d _i,a d )^ 

.6e{o,i}l' , - 1 l I ce{o,i} ^ / 



C span 



n 



.6e{o,i}i d - 1 i,ce{o,i} 



o d _i,a d GF 



Thus taking this for each a we get span(_A/f • -M^) C span_A/f' , implying span.M' = spanAI as 
desired. □ 

The next lemma concludes that the span-preservation property shows that we indeed have a 
generator, and can thus construct a hitting set. 



Lemma 3.21. Assume the setup of Construction 3.13. Let G F[xg] MxM for b G {0, 1} M 6e 
o/ degree < n, and let f{x) = ^Obelo 1}M -^%( x fo)) Let SCF, with \S\ > Dn 2 r 4 . Then f = 
iff f Qd = iff /|g d ( S d+i) = 0. 



0,0 



Proof. As / is the (0, 0)-entry of the matrix product Hbefo i}W ^b^ x b^ an< ^ ^his P r °j ec tion operator 
is linear, we see that Equation 3.19 implies that span{/(x)}^ gF [D] = span{/(^(a))} (3gF [d+ii , and 



as these spans are simply constant multiples of the output of /, we see that / = 0iff/o£7 d = 0. 
Next, we observe that for t£ ld+ 1], deg„.(/ o Q d ) < Dn 2 r A using that / has individual degrees 
< n, and invoking the degree bound in Lemma 3.16| Thus, by basic (multivariate) polynomial 
interpolation, / o Q d = iff (/ o £?d)|sd+i = iff /|g d (s<J+i) =0. □ 

We can now prove our main theorem. 

Theorem 3.22 (PIT for read-once oblivious ABPs). Let C be the set of polynomials f : FPl — >• F 
computable by a widthr, depth D, individual degree < n read-once oblivious ABP. If\¥\ > (2Dnr 3 ) 2 , 
then C has a poly (D,n,r)- explicit hitting set, of size < (2Dn 2 r 4 )r ig - D l +1 . 



'Lemma 3.7 shows that there are < (Dnmr) 2 = D 2 n 2 r 6 possible bad values of &d-i, but in Lemma 3.21 
that there are "really" only Dn 2 r 4 many bad values. It is unclear at present how to directly improve Lemma 3.7 



we show 
to 



reflect this better bound. 
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Proof. By |Lemma 3.1[ any such read-once oblivious ABP computes a polynomial / expressible as 
f(x) = (IliefD] ^( x *))(o,o)> where Mi(xi) G F[xj]M x M is of degree < n. Further, we can round D 
up to 2^ lgD \ and by setting Mj(xj) = I for i > D (ie. padding the product with identity matrices 
in new variables), we get that f(x) = (ILep^D!] M i(xi))o,o- 

We now show that there is some element oj E F of order > (2^ gD ~^nr 2 ) 2 , that can be found in 
poly(D, n, r)-time. As there at most d elements such that x d = 1 in F, it follows that there are at 
most < (2Dnr 2 ) many non-zero elements of order < (2Dnr 2 ) 2 , so any enumeration of non-zero 
elements in F will find such an element of large order, if (2Dnr 2 ) 4 < |F|. If (2Dnr 2 ) 4 > |F|, then 
recalling that the multiplicative group of a finite field F is cyclic, we see there is an element g G F 
of order |F| - 1 > (2Dnr 3 ) 2 - 1 > (2^ gD ^nr 3 ) 2 (using that 2D > 2^^), and thus an enumeration 
can also find such an uj. Similarly, we can find r 2 distinct f3j, and can find some set S C F of size 



2 rig£>l n 2 r 4_ With ^ tlie £., s and ^ we gee then ^ b y |L emma 3 2i[ that Gd{S^ lgD ^ +1 ) is a hitting 
set for / and has the desired size. Further, for any fixed s£ ,S'' lgL) l +1 , Q^s) can be computed in 



poly(D, n,r) time as each coordinate is computed by a small ABP, as shown in Lemma 3.18. □ 



We note that Theorem 3.23 is an immediate corollary. 



Theorem 3.23 (PIT for set-multilinear ABPs). Let X = \J i£ ^ D jXi where Xi = {2^j'}je[ n ] be a 
known partition. LetC be the set of set-multilinear polynomials /(Ujgj^jXj) : FH nD l — > F computable 
by a width r, depth D, set-multilinear ABP. If |F| > (2Dnr 3 ) 2 , then C has a po\y(D,n,r)- explicit 
hitting set, of size < (2Dn 2 r A ) Ps^H 1 . 

Proof. By the degree bounds, the Kronecker map Xij <— x\ induces a bijection among the mono- 
mials, and takes the original set-multilinear ABP to an read-once oblivious ABP, where we replace 
the set of variables Xi with the single (new) variable Xj. Structurally, this is still an width r, depth 



D ABP, and as we index from zero, the individual degrees are < n. Appealing to Theorem 3.22 



gives the result. □ 

4 Non-commutative ABPs 

In this section, we show how to do black-box identity testing of non-commutative ABPs in quasi- 



polynomial time. A non-commutative ABP is structurally the same as defined in Definition 1.1 
but we now do not assume the variables commute. Formally, it is an ABP over the ring F{x}, of 
polynomials in non-commuting variables. Nisan [Nis91] first explored this model, noting that any 
commutative polynomial / can be computed non-commutatively (for example, by the complete 
monomial expansion) and thus non-commutative models of computation form a restricted class of 
computation we can explore. In the same work, he proved the first exponential lower bounds in 
the non-commutative ABP model for the (commutative) determinant and permanent polynomials. 
Later, Raz and Shpilka [RS05] gave a polynomial time PIT algorithm in the white-box model for 
this class. 

In both of the above works, non-commutativity can be seen as "syntactic" in the sense that 
one can treat the variables as formally non-commutating free variables, and one doesn't seek to 
substitute values for these variables. However, black-box PIT of non-commutative polynomials in 
¥{x} by definition requires such a substitution. Such a substitution will occur in some ring TZ 



that is an F-algebra (that is, there is a (commutative) ring homomorphism_J F — > TZ). Clearly, 7Z 

8 We need this homomorphism to make sense of how elements of F act on elements of TZ. That is, when evaluating 
a non-commutative polynomial / in ¥{x} on elements of TZ, we replace the coefficients in / with their images under 
this homomorphism. 
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must be non-commutative in order to witness the non-identity xy — yx in F{x}. Trivially, we could 
choose U 1Z = ¥{x} n and get a black-box PIT algorithm that only requires a single query, but this 
simply pushes the complexity of the problem into the operations in 1Z. That is, one would want 
the black-box PIT queries to be efficiently implementable given white-box access to the circuit. As 
such, it seems natural to ask that 7Z is a finite dimensional F-vector space, and that the number 
of dimensions is polynomial in the relevant parameters. Further, another natural restriction is to 
take 1Z = Fl m l x I m l, the algebra of matricea^J and that is what we do here. 

However, once we have moved from evaluations in F{x} to evaluations in 1Z, there is the concern 
that we have lost information, in that / G ¥{x} could vanish on all possible evaluations over 1Z. 
Note that this is also a problem in the commutative case, as in the standard example of the 
polynomial x 2 — x G ^2[x] which vanishes over the field F2. In the case of matrices, which is the 
ring we shall work over, there are such identities given by the Amitsur-Levitzki theorem JAL50J: 
the polynomial ^2 ae g 2m sgn(cr) I^spm] Ar(i)> where S2m is the symmetric group on 2m elements, 
vanishes on every choice ofmxm matrices Ai, over any field. 

However, recall that in the commutative case, (multivariate) polynomial interpolation states 
that for a polynomial / G F[x] of (total) degree < d, f cannot vanish on all evaluations over F as 
long as |F| > d. Extending this, the Schwartz-Zippel lemma shows that if |F| 3> d then / has a very 
low probability of vanishing on a random evaluation over F. This result, applied via a union bound 
in a probabilistic argument, shows that efficient black-box PIT is (existentially) possible for small 
ABPs. In almost direct analogy, the Amitsur-Levitzki theorem shows that polynomials of (total) 
degree < 2m in ¥{x} cannot be identities over 

FH X H. Bogdanov and Wee |BW05| observed 

that this result, in combination with the Schwartz-Zippel lemma, show that if |F|,m 3> d then a 

non-commutative polynomial / has a very low probability of vanishing on a random evaluation over 
F [m]xlm]_ And th 

as in the commutative case, this establishes existence of small hitting sets for 
non-commutative polynomials computed by small ABPs. Given this existence argument, we now 
proceed to construct quasi-polynomial sized, explicit hitting sets, for non-commutative ABPs. 

We proceed in a fashion similar to Bogdanov and Wee [BW05;, by giving a family of ma- 
trices with entries consisting of commutative variables, such that any non-zero non-commutative 
polynomial / over these matrices induces a non-zero family of commutative polynomials over the 
commutative variables. Further, we show that if / is computable by a small non-commutative 
ABP, then the resulting commutative polynomials are computable by small set-multilinear ABPs. 



We can then appeal to the hitting set for this class of polynomials, shown in Theorem 3.23. We 
first illustrate the construction of our matrices. 

Construction 4.1. Define (p : F{xj} ig [ n j — > F[xij] ig | n ] j e pj to be the unique F-linear map defined 
by ip(l) = 1, and 

Lp[xi 1 Xi 2 ' ' ' X{ d ^) . — Xi 1 ^\Xi 2 ,2 ' ' ' •£ , i ( i,d ■ 

For i G InJ, D > 0, define the upper-triangular matrix Xi t £> G (F[xj i j]j e N)"" D+1 " >< ^" D+1 ^ by 



ld He- 




Define the vector X D := (^,D)ieM' 

9 The relevant homomorphism F — > F' m " x ^ m ' maps a h-> ai, where I is the identity matrix. 



20 



Thus, pictorially, we have that the matrices X^jj look as follows. 



X, 



LD 



















x i>2 

























••• 













••• 








Xi,D 





••• 












These matrices are quite similar to the matrices used in [BW05J (see their Lemma 3.2), but they 
achieve a smaller dimension (in fact, matching the best possible, as seen by the Amitsur-Levitzki 
theorem). However, their construction does not seem to give the reduction to set-multilinear 
computation that we need. We now establish properties of our construction. In particular, we 
relate the evaluations of / on the matrices X{, to the commutative polynomial <p{f)- 



Lemma 4.2. Assume the setup of Construction 1^.1 , Let f E ^{^j}ie[ri] be a non- commutative 
polynomial. For D > £ > 0, 

(f(X D ))o,i = <p(H t (f)) . 



Proof. By F- linearity (of ip, Hg(»), and (*)oi), it is enough to prove the claim for monomials 
fix) = Xi ± Xi 2 ■ ■ ■ Xi d for any d > 0. For d = 0, the only monomial is the identity matrix I, for which 
the claim is clear. 

Now consider d > 0. Observe that 

{ x n ' ' ' Xi d ) ,£ = ^2 {X il )e 0t e 1 (X i2 )£ l! e 2 ■ ■ ■ (X id )e d _ lt £ d 

i =o,e d =e 
fi,..,4_ie[D+i] 

and by construction, we see that if + 1 7^ lj then (Xij)^.^^ = 0, and so, 

= XT ( x h)io,ti( x ^)ii,i2---{Xi d )t d _ lt i d 

e =o,e d =e 
e u ...,e d - 1 eiD+ii 

+ J ^11,1 ' ' ' Xi d) d d — £ 

[0 else 
= ip(H e (x h ■ ■ ■ x id )) 

as desired, where (f) follows from the observation that the only way a sequence of length d+1 can 
start at and end at £, increasing 1 at each step, is if d = £, and £j = j. □ 

We now use this to give the black-box reduction from non-commutative polynomials to set- 
multilinear polynomials. 



Lemma 4.3. Assume the setup of Construction 4-1 ■ Let f E F{xj}j g | n j be a non-commutative 
polynomial of degree < D. Then for each £ G \D + \\, the (commutative) polynomial (/(-Xr>))o,f 
is set-multilinear, and f = iff for all £ G [.D + 1], (f(X£>))o,t = 0. Consequently, f = iff 

f(X D ) EE 0. 
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Proof. As deg(/) < D, it is clear that / = iff for all I £ [D + lj, H e (f) = 0. Further, we see 
that tp is a bijection, so Hg(f) = iff (p(Hg(f)) = 0, and that (p applied to any homogeneous 



non-commutative polynomial yields a set-multilinear polynomial. Finally, we use Lemma 4.2 to see 



that <p(H e (f)) = iff {f(X D )) 0/ = 0. It follows then that /(f) = iff f(X D ) = 0. □ 

Thus, this lemma says that to do black-box PIT of non-commutative polynomials, it is enough 
to do black-box PIT of set-multilinear polynomials, in order to hit each of the (/(-X"d))o,£- However, 
in order for this reduction to be useful, we must show that (/(^Cd))o/ is n °t only set-multilinear, 
but is also computed by a small computational device, as otherwise no small hitting set may exist. 

In the most general model, that of non-commutative circuits, it is not clear there is a simple 
bound on the complexity of (f(XD))o,£ hi terms of the complexity of /, as circuits have no global 
"order" in the computation. That is, in a non-commutative circuit each multiplication is ordered, 
and this local order is fixed by the circuit. However, as parts of the circuit may be reused, a 
partial computation in the circuit can appear in many different parts of the final computation. 
Concretely, in the circuit of the repeated squaring computation ((x) 2 ) 2 , we cannot assign "x — > x{" 
in any way compatible with the fact that tp(x 4 ) = X1X2X3X4. Thus, it is not clear how to convert a 
non-commutative circuit for / into a circuit for ip(Hg(f)). 

However, for weaker models of computation such as ABPs, there is such an notion of ordering 
as each multiplication proceeds from the source to the sink, in a single direction. This allows us 
to relate the ABP complexity of a (non-commutative) polynomial / to the ABP complexity of the 
(commutative) set-multilinear polynomial (p{Hi(f)). To do so, we first cite the following lemma, 
implicit in Nisan |Nis91| (see also Lemma 2 in Raz-Shpilka [RS05p^|. 

Lemma 4.4 (Nisan, |Nis91J). Let f € F{x} be a non- commutative polynomial computed by a non- 



commutative ABP of depth D and width r. Then foi 1 < £ < D, Hg(f) it computed by a 



non-commutative ABP of depth I and width < r{D + 1). Further, each edge in this ABP is labeled 
with a homogeneous linear form. 

We now show that for ABPs as the above lemma outputs (those with only homogeneous linear 
forms on the edges) there is a clear connection between the ABP complexity of / and </?(/)■ 

Lemma 4.5. Let f £ ¥{x} be a non-commutative polynomial computed by an ABP A, whose edges 
are labelled with homogeneous linear forms. Let A' be the ABP with the same structure as A, but for 
each edge from layer j — 1 to layer j with label X^ie[n] a i x i> we replace the label with Xyjefn] a i x i,j- 
Then A' computes <p(f), and A' is a set-multilinear ABP. 

Proof. By linearity, it is enough to prove the claim for a single path in the ABP A. The path 
computes the product of its labels, and thus yields the product of linear forms 

9 = IIE a ^ Xi = Yl ! II )( TI- r '. ! • 

3 iS[n] *ftGM 3 j 

Similarly, the same path in A' computes 

II Yl = (Yl a i 3 ,o)(Yl x ij,j) 

3 ieH in£[n] 3 3 



10 In fact, the version that we use is slightly stronger than the version in |RS05| . but the proof is the same and the 
only difference is that we use a less wasteful analysis, which is obvious from the proof there. 

11 The case when I — does not fit into this technicalities of this lemma, so will be treated separately in what 
follows. 

12 In fact, these ABPs can efficiently computed given the original ABP, but we do not use this fact. 
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which equals <p(g) by linearity, as desired. Finally, we see that the edges from layer j — 1 to layer 
j only involve the variables x,j, so the ABP is indeed set-multilinear. □ 

Combining the above results, we can now give the full reduction. 



Theorem 4.6. Assume the setup of \Construction 4-1 ■ Let % be a hitting set for set-multilinear 



(commutative) polynomials with the (known) variable partition \Jj e ^ D j{xij} ie ^ n j computed by set- 
multilinear ABPs of width < r(D + l) and depth < D. Define W C (f[ d+1 M£>+iJ)M oy replacing 
each evaluation point in % with the corresponding evaluation point defined by the matrices Xr). 
Then H is a hitting set for non-commutative ABPs of width r and depth D. 

Proof. Let f be computed by a non-commutative ABPs of width r and depth D. Then, by 



Lemma 4.3 



/ = iff for all £ G [D + lJ, the set-multilinear polynomial (/(^d))o/ = 0. If 
£ = 0, then we can observe that as the Xi are strictly upper-triangular, it happens that (/(Xd))o,o 
is constant for any setting of Xd (and in fact equals the constant term of /), so it is then clear 
that {f{X D ))o,o = iff ( f(X D ))o, y H , = as \n'\ > 0. 

If £ > then we use Lemma 4.2| an d Lemma 4.3 to see that (/(Xd)) ^ = ^p(H^(f)) is a set- 
multilinear polynomial, and Lemma 4.4 and Lemma 4.5| imply that ip(Hi(f)) is computed by a width 



< rD, depth < D set-multilinear ABP, and thus (f(X D )) 0/ ee iff (f(X D )) 0)i \ ^ d£W =0. □ 

Plugging in our hitting set for set-multilinear ABPs from |Theorem 3.23[ we obtain the following 
corollary. 

Corollary 4.7 (PIT for non-commutative ABPs). LetMC be the set of n-variate non- commutative 
polynomials computable by width r, depth D ABPs. If |F| > (2(D + l) 5 nr 4 ) 2 , then J\fC has a 
poly (D,n,r)- explicit hitting set over (D + l) x (D + l) matrices, of size < (2(D + l) 4 n 2 r 3 )^ lg£) ^ +1 . 

Remark 4.8. The use of (D + 1) x (D + 1) strictly upper-triangular matrices in the above results is 
optimal, in the sense that if we used m x m strictly upper-triangular matrices for m < D + I then 
no such result is possible, as any such matrix X has X D = 0, and the monomial x is computable 
by a depth D ABP. 



5 Diagonal Circuits 

The model of diagonal circuits was first defined by Saxena |Sax08j in order to better understand 
depth-4 circuits. Much previous PIT research focused on small-depth circuits with restricted top 
fan-in. In contrast, the diagonal model allows unbounded top fan-in, but each multiplication gate 
only has few distinct factors (but the factors can have high multiplicity). 

To ease reading we shall make use of the following shortened notation. Given a vector e G N n , 
we denote |e|i := e\ H V e n , |e|oo = maxj e«, |e| x = TLeM^ 1 + e *)' and e ! = • • • e n \. 

For depth-3 diagonal circuits, each multiplication gate has the form £(x) e , for affine linear forms 
£i, where |e| x is bounded by poly(n). Saxena's techniques also extend to the case when the £{ are 
sum of univariate polynomials, which is the depth-4 diagonal circuit model. We now give a formal 
definition. 

Definition 5.1 (Diagonal Circuits). A diagonal depth-4 circuit has the form $ = Yli=i^i' 
where each product gate is of the form *$>i = P^ % , and each Pij(x) is a sum of univariate 
polynomials, Pjj = Ylm=i 9i,j,m(x m ). The syntactic degree of the polynomial computed by the 
circuit is s-deg(<l?) = maxj deg(^j) ; where deg(^j) = • ey deg(Pjj) (i.e. the syntactic degree of 
<]? is the maximal degree of a multiplication gate in &). 
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Saxena |Sax08] proved the following theorem, establishing a white-box PIT algorithm for diag- 
onal depth-4 circuits. 

Theorem 5.2 (Saxena, (SaxOSj). There is a deterministic algorithm that, given as input a diagonal 
depth-4 circuit $ ; runs in time poly(n/c • s-deg(<£), maxj e ny |ej| x ) and decides whether <I> = 0. 

We give a black-box PIT algorithm for diagonal depth-4 circuits whose running time is quasi- 
polynomial in the runtime established by Saxena. Saxena gave a white-box reduction from diagonal 
depth-4 circuits to non-commutative PIT over algebras, and claimed that the Raz-Shpilka [RS05 
algorithm can be extended to handle this case. Our result uses a poly-time black-box reduction 
from diagonal depth-4 circuits to read-once oblivious ABPs. We follow Saxena's duality idea, but 
"pull" the complexity of the algebras he works with into the ABP itself. We can then apply our 
hitting sets on read-once oblivious ABPs, which incurs the quasi-polynomial overhead. 

Our reduction will be based on the duality idea of Saxena [Sax08j (see his Lemma 2). However, 
the prior duality statements were cumbersome over fields of positive characteristic. Saxena jBax08 
only alluded briefly to this case, and later Saha, Saptharishi and Saxena [SSSllj gave a description, 
arguing that duality holds if one can move from Z p to Z p jv for some large N. This is not suitable for 
black-box reductions, as we cannot change the characteristic in a black-box way. Thus, we develop 
a unified duality statement that works for all fields, and also works in the black-box setting. This 
duality improves on that of Saxena [Sax08j by examining individual degree as opposed to total 
degree, and using this technical improvement we can extend the result to positive characteristic by 
using the Frobenius automorphisms. 

We will denote the exponential power series as exp(x) = Yle>o fn anc ^ * ne truncated exponential 



by |_expj e (j;) = Yle=o IP Further, recall our meaning, stated in Section 2, for extracting coefficients 
of polynomials, and its meaning when we take coefficients "over F[xJ[y]" (as opposed to " over 

man 

We first establish duality over Q, treating the coefficients c of our polynomials as formal vari- 
ables. 

Lemma 5.3 (Duality over Q). Let ^ = P e G Z[x, c], where Pj(x,c) = ^2 m gj,m(x m ,c)- Then, 
taking coefficients in Q[x, c[[z\, 

^* = f II (n LexpJ e . g jtm (x m , S)zA j j . (5.4) 

Proof. We will work over the power series Q[x, c, z\. In particular, for / € Q[x, c], we have the 
identity 

1 

-,f{x, cf = €. z e exp(/(x, c)z) . 
e! 

Note that this is a formal identity, that is, there is no notion of convergence needed as each coefficient 
in the power series of exp(/(x, c)z) is the sum of finitely many non-zero terms. It follows then that 

^T* = II ^zV e MPj(x, c] Zj ) = tgg JJexp(P i (f, c)Zj) 
j 3 

using the identity exp(/ + g) = exp(/) exp(g), which formally holds in Q[x, c, z\, 
= Cg^exp ^2Pj(x,c)Zj = Cggexp gj,m(x m , c)zj 
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reversing the order of summation, 



= € g g exp I ^2 ^2 9j,m(x m , c]zj 

\ m j 

= ]J II 6XP (9j,m{ X m, c)Zj) 
m j 

and now discarding terms of degree > ej in Zj, as that do not affect the coefficient of z 5 , 

= II II L ex PJ i9j,m {x m ,c)Zj) □ 
m j 

Even though the above proof used the exponential function, which has no exact analogue in 
positive characteristic, the final statement is just one of polynomials, and thus transfers to any 
field of sufficiently large characteristic. In particular, the characteristic must be > |e|oo, the 
maximum exponent. We now transfer this duality to any field. We will abuse notation and treat 
the characteristic zero case as working over characteristic p, where p = oo. Note that the "base-oo" 
expansion of any integer is just that integer itself, followed by zeroes. 

Lemma 5.5 (Duality over any F). Let ¥ be a field of characteristic p (for characteristic zero, we 
abuse notation and use p = oo). Let ^ = P e 6 ¥[x], where Pj(x) = ^ m 9j,m(xm)- 

Write ej in its base-p expansion, so that ej = ^fce[n ] a j,kP k w tth ®j,k £ [pj- Denote Qjfiffl := 

Em 9j,m(xm) pk and Q s := Uj Uk 0$? ' Then > P§ = & and 



i } Q s = ( II f II L ex pJ aj , fc f E*>(*nO p % ] ] 

\ m \j,k \ j,k ) ) 



(5.6) 



and \a\ x < |e|: 



Proof. P e = Q a : This follows from the Frobenius automorphism, that (x + y) p = x p +y p for any 
k > 0, in any field of characteristic p. That is, 

p* = n Fp = n P f ^ aj ' kpk =nn P ;^ k = n H(pfr^ 

j j j k j k 

/ / \P k \ aj - k 

= nil [J29lm(Xr, 
j k \\m 



by the Frobenius automorphism, 

/ \ a j,k 



j k \ m / j k 



5.6|): Note that the identity of Lemma 5.3 only involves rational numbers whose denominators 



< |e|oo) the maximum exponent. Thus, as \a\oo < p by construction, Lemma 5.3 is also an identity 



over Z p [x, c] when applied with the exponent vector a, and any g polynomials. In particular, we can 
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substitute values for the constants c, implying that we can take the g polynomials to be polynomials 
in x with constants from any field extending Z p , such as F. Thus, the identity can be applied to 
Q a over F, yielding the result. 

\a\x < [gjxj As |e| x = T[j(l + ey) and \a\ x = Ylj Ylkeln-jQ + a j,k)i it is sufficient to prove that 
rifc(l + a fc) < 1 + e, for e = ^ fc with a& G |p]. To see this, first define T := {e' : < e' < e} 
and note that \T\ = 1 + e. Second, let 5 = {e' : e' = J2 k a' k p k , < a' k < a^}. Observe that S C T, 
and |5| =n fc (l + afc), son fc (l + afc) = 1^1 < |T| = 1 + e. " □ 

To use this lemma, we first need a structural result about ABPs. In this lemma we broaden 
the notion of an ABP to have arbitrary labels from a commutative ring lZ[x], and will specialize 
this to our setting later. 

Lemma 5.7. Let A be an layered ABP of depth D and width < r, with edge labels in 1Z[z\ for some 
commutative ring 1Z, computing an n-variate polynomial f G TZ[z\. Then for any e G N n , there is 
an ABP A' , with edge labels in 1Z, of depth D and width < r ■ |e| x computing £gg(f). 

Further, the edge labels between layers i — 1 and i in A' occur as coefficients of the edge labels 
between layers i — 1 and i in A. 

Proof. Define S := {a : < a < e, a £ N n }, where we impose the natural partial order on N n . 
Clearly |5| = |e| x • Index the nodes in A by [r] x \D + 1], so that the i-th layer in A consists of the 
nodes [r] x {%}. Define the nodes of A' to be [r] x S x \D + lj, so that the i-th layer of A' consists 
of the nodes [r] x S x {i}. 

For each edge (v x (i — 1), v' x i) in A with label g £ 7Z[z\, and each a, a' G S with a < a' , define 
the edge label (v x ax (i — 1), v' xa'xi) to be £^a'-a(g)- Letting the source of A be denoted sxO, 
and the sink denotes t x D, we define the source of A' to be s x x and the sink to beixexD. 
After removing nodes that belong to no source-sink path, it is clear that A' is a layered ABP, with 
depth D, width < r • |e| x . Further, the edge labels between layers i— 1 and i in A 1 are coefficients 
of the edge labels between layers i — 1 and i in A as desired. It remains to show that A' computes 
as desired, which follows from an induction on layers. Specifically, one can see that the paths from 
the source of A' to v x a x i compute €^s(f v xi), where f VX i is the polynomial computed by the 
paths from the source in A to v x i. □ 

We now apply this structural result, along with Saxena's dual form, to show that depth-4 
diagonal circuits can be computed by small read-once oblivious ABPs. 

Lemma 5.8. Let F be any field. Let f(x±, . . . , x n ) be computed by a diagonal circuit $ = Y2i=i 
where = P? % ■ Denote the syntactic degree of f to be s-deg(/) = maxj s-deg('Fj). Then there is an 
read-once oblivious ABP A computing f with variable order x\ < ■ ■ ■ < x n , with depth n and width 
< k ■ max ig [fc] |ej| x such that the edges in A are labeled with polynomials with degrees < s-deg(/). 

Proof. Let p denote the characteristic of F, with the convention that p = oo for characteristic zero 



fields. By the dual form, Lemma 5.5, we have that for each i, 

*i = (/i) 

where Si is the base-p expansion of e*j (and |aj| x < |e*j| x ), fi G F[5;][z|, and fi is a product of n 
terms, where the j-th term only involves Xj amongst the x variables. It follows that fi can be 
computed using a width-1, depth-n ABP in F[x][z|, where the labels from layer j — 1 to j only 
involve Xj amongst all x variables. 
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Applying Lemma 5.7 to fi, taking 1Z = ¥[x], we see (after a scalar multiplication by aj!) that 



i&i is computable by an ABP Ai, which depth n, and width < \Si\x < |ej| x - Further, Lemma 5.7 
establishes that the labels in Ai from layer j — 1 to layer j come from the labels (by taking 
coefficients) on the edges from layer j — 1 to layer j in the ABP computing the product fi in 
F[x][i]. Thus, we see that the labels from layer j — 1 to layer j in are polynomials in Xj, and 
have degree at most s-deg^j), and thus Ai is an read-once oblivious ABP. 

Now, observe that the read-once oblivious ABPs for the *&i can be summed by merging all 
sources and merging all sinks, and that the resulting ABP A is read-once oblivious, as the variable 
ordering of x is consistent amongst the Ai. It follows that A has the desired properties. □ 

As the above lemma is constructive, it follows that white-box access to / implies white-box 
access to the read-once oblivious ABP A, and thus we could run the white-box algorithm of Raz- 
Shpilka [RS05] to derive a white-box PIT algorithm, in order to rederive Saxena's result (over any 
field). However, as we also have black-box PIT algorithms for read-once oblivious ABPs (and the 
above uses the fixed variable order x\ < ■ ■ ■ < x n ), and the above reduction does not need access to 
/, we can combine these results to deduce the following black-box PIT result for diagonal depth-4 
circuits. 

Theorem 5.9 (Black-box PIT for diagonal circuits). Let ¥ be a field of arbitrary characteristic. 
Let T>C be the set of n-variate polynomials computable by diagonal depth-4 circuits, that is, of the 
form = X^«e[fc] *> where |e*j| x < e and Pij is of degree < d. Then if |F| > (2mi/c 3 e 4 ) 2 then DC 
has a poly (n,k,e,d)- explicit hitting set of size < (2nd 2 k i e e )^ lgn ^ +1 . 



Proof. From [Lemma 5.8 we get that any $ G T>C can be computed by an read-once oblivious ABP 



on variable order x\ < ■ ■ ■ < x n , of depth n, width < ke, with each edge la bel being a univ ariate of 

to finish 
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degree < s-deg^). Noting that s-deg(P^) < |e*j|i • d < |e*j| x • d, we invoke 
the claim. □ 

Remark 5.10. We remark here that the concurrent work of Agrawal, Saha and Saxena [ASS12 
also obtain a quasi-polynomial hitting set for diagonal depth-4 circuits (when they assume each 
product gate is over a constant number of factors), but only over large characteristic, as they 
rely on the duality statements of Saxena [Sax08 (and later exposited by Saha, Saptharishi, and 
Saxena |SSSllj ) which only hold over large characteristic. Over small characteristic, fSax08j and 
[SSS11] gave more cumbersome duality statements working over prime-power characteristic, and 
[ASS 12] do not extend their work to this case. 

Our duality statements work any characteristic, and as such can show that the work of [A~SS12 
also implies results for diagonal circuits over any characteristic. In particular, instead of using 



Lemma 5.5 to construct an ABP, one can interpolate the coefficient in Lemma 5.5, and this 
can be done in depth-3, although the resulting formula is inherently larger than the resulting ABP 
would be. 



5.1 Semi-diagonal Depth-4 circuits 

In Saha, Saptharishi, and Saxena [SSS11] the model of semi-diagonal depth-4 circuits was intro- 
duced as a small extension of diagonal depth-4 circuits. The modification is that one is allowed to 
multiply each product gate = P e by an arbitrary monomial. [SSSll] used that the duality result 
of Saxena [Sax08j can also be shown to work in this setting. The concurrent work of Agrawal, Saha 
and Saxena [ ASS12| thus present their results for semi-diagonal depth-4 circuits, as opposed to 
just diagonal depth-4 circuits. For ease of comparison, we also state our result for this model. As 
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multiplication by a single monomial in Lemma 5.5 preserves the variable disjoint product structure 
(but increases the degree), it follows that we can still convert semi-diagonal circuits to read-once 

Tius, as the details are the same, we conclude the 



oblivious ABPs, as stated in Lemma 5.8 
following result. 

Theorem 5.11 (Black-box PIT for semi-diagonal circuits). Let ¥ be a field of arbitrary charac- 
teristic. Let SVC be the set of n-variate polynomials computable by semi- diagonal depth-4 circuits, 
that is, of the form $ = X^ie[fc] m »(^)-^fS where mi{x) is a monomial of degree < d, |ej| x < e and 
Pij is of degree < d. Then if |F| > (4ndk 3 e 4 ) 2 then SVC has a poly(n, k, e, d) -explicit hitting set 



of size < {8nd 2 k 4 e (i )^ +1 . 



6 Discussion 

This work closes some gap in our understanding of white-box PIT vs. black-box PIT by transferring 
algorithms that used the partial derivative technique to the black-box world. The recent work of 
[ASS12] made another significant step by considering set-multilinear formulas (of small depth) 
where the partition is unknown. It will be very interesting to try and combine these two works to 
obtain a PIT algorithm for set-multilinear ABPs when the underlying partition is not known. 

Another interesting goal is to truly close the gap between white-box and black-box. Specifically, 
all black-box algorithms for the models studied in this paper (as well as in [ASS12 ) run with a 
quasi-polynomial overhead over the run-times of the corresponding known white-box algorithms. 
Ideally, this overhead could be made polynomial. 
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